WO 96/24673 




POYUS96/02153 



-103- 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: LeukoSite, Inc. 

(B) STREET: 215 First Street 

(C) CITY: Cambridge 

(D) STATE/PROVINCE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) POSTAL CODE /ZIP: 02142 
<G) TELEPHONE: (617) 621-9350 
(I) TELEFAX: (617) 621-9349 

( i ) APPLICANT/ INVENTOR: 

(A) NAME: Michael J. Briskin 

(B) STREET: 28 Harbell Street 

(C) CITY: Lexington 

(D) STATE/PROVINCE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) POSTAt CODE/ ZIP: 02173 

( i ) APPLICANT / INVENTOR : 

(A) NAME: Douglas J. Ringler 

(B) STREET: 382 Ocean Avenue, #1008 

(C) CITY: Hevere 

(D) STATE/PROVINCE: Massachusetts 

( E ) COUNTRY : U.S.A. 

(F) POSTAL CODE /ZIP: 02151 

( i ) APPLICANT/ INVENTOR: 

(A) NAME: Dominic Picarella 

(B) STREET : 1 2 North Bennet Court, #4 

(C) CITY: Bpston 

(D) STATE/PfcOVINCE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) POSTAL CODE /ZIP: 02113 



(i) APPLICANT/ INVENTOR: 

K (A) NAME: Walter Newman 

\ v \^\ (B) STREET: \3 Durham Street, #3 

"A /V- > ( C ) CITY : Boston 

XI./ (D) STATE/PRpVINCE: Massachusetts 

(E) COUNTRY :[ U.S. A. 

/ (F) POSTAL cbDE/ZIP: 02115 



(ii) TITLE OF INVENTION: Mucosal Vascular Addressins and Uses 
Thereof 

(iii) NUMBER OF SEQUENCES: 13 

i 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hamilton, Brook, Smith & Reynolds, P.C. 

(B) STREET: Two Militia Drive 

(C) CITY: Lexington 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02173-4799 
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(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/523,004 

(B) FILING DATE: 01-SEP-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/386,857 

(B) FILING DATE: 10-FEB-1995 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Brook, David E. 

(B) REGISTRATION NUMBER: 22,592 

(C) REFERENCE /DOCKET NUMBER: LKS94-04A2 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-861-6240 

(B) TELEFAX: 617-861-9540 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . 1218 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GAT TTC GGA CTG GCC CTC CTG CTG GCG GGG CTT CTG GGG CTC CTC 48 
Met Asp Phe Gly Leu Ala Leu Leu Leu Ala Gly Leu Leu Gly Leu Leu 
15 10 15 

CTC GGC CAG TCC CTC CAG GTG AAG CCC CTG CAG GTG GAG CCC CCG GAG 9 6 

Leu Gly Gin Ser Leu Gin Val Lys Pro Leu Gin Val Glu Pro Pro Glu 
20 25 30 

CCG GTG GTG GCC GTG GCC TTG GGC GCC TCG CGC CAG CTC ACC TGC CGC 144 
Pro Val Val Ala Val Ala Leu Gly Ala Ser Arg Gin Leu Thr Cys Arq 
35 40 45 
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CTG GCC TGC GCG GAC CGC GGG GCC TCG GTG CAG TGG CGG GGC CTG GAC 192 
Leu Ala Cys Ala Asp Arg Gly Ala Ser Val Gin Trp Arg Gly Leu Asp 
50 55 60 

ACC AGC CTG GGC GCG GTG CAG TCG GAC ACG GGC CGC AGC GTC CTC ACC 240 
Thr Ser Leu Gly Ala Val Gin Ser Asp Thr Gly Arg Ser Val Leu Thr 
65 70 75 80 

GTG CGC AAC GCC TCG CTG TCG GCG GCC GGG ACC CGC GTG TGC GTG GGC 2 88 

Val Arg Asn Ala Ser Leu Ser Ala Ala Gly Thr Arg Val Cys Val Gly 
85 90 95 

TCC TGC GGG GGC CGC ACC TTC CAG CAC ACC GTG CAG CTC CTT GTG TAC 336 
Ser Cys Gly Gly Arg Thr Phe Gin His Thr Val Gin Leu Leu Val Tyr 
100 105 110 

GCC TTC CCG GAC CAG CTG ACC GTC TCC CCA GCA GCC CTG GTG CCT GGT 384 
Ala Phe Pro Asp Gin Leu Thr Val Ser Pro Ala Ala Leu Val Pro Gly 
115 120 125 

GAC CCG GAG GTG GCC TGT ACG GCC CAC AAA GTC ACG CCC GTG GAC CCC 432 
Asp Pro Glu Val Ala Cys Thr Ala His Lys Val Thr Pro Val Asp Pro 
130 135 140 

AAC GCG CTC TCC TTC TCC CTG CTC GTC GGG GGC CAG GAA CTG GAG GGG 480 
Asn Ala Leu Ser Phe Ser Leu Leu Val Gly Gly Gin Glu Leu Glu Glv 
145 150 155 160 

GCG CAA GCC CTG GGC CCG GAG GTG CAG GAG GAG GAG GAG GAG CCC CAG 528 
Ala Gin Ala Leu Gly Pro Glu Val Gin Glu Glu Glu Glu Glu Pro Gin 
165 170 175 

GGG GAC GAG GAC GTG CTG TTC AGG GTG ACA GAG CGC TGG CGG CTG CCG 5 76 

Gly Asp Glu Asp Val Leu Phe Arg Val Thr Glu Arg Trp Arq Leu Pro 
180 185 190 

CCC CTG GGG ACC CCT GTC CCG CCC GCC CTC TAC TGC CAG GCC ACG ATG 62 4 

Pro Leu Gly Thr Pro Val Pro Pro Ala Leu Tyr Cys Gin Ala Thr Met 
195 200 205 

AGG CTG CCT GGC TTG GAG CTC AGC CAC CGC CAG GCC ATC CCC GTC CTG 672 
Arg Leu Pro Gly Leu Glu Leu Ser His Arg Gin Ala lie Pro Val Leu 
210 215 220 

CAC AGC CCG ACC TCC CCG GAG CCT CCC GAC ACC ACC TCC CCG GAG CCT 720 
His Ser Pro Thr Ser Pro Glu Pro Pro Asp Thr Thr Ser Pro Glu Pro 
225 230 235 240 

CCC AAC ACC ACC TCC CCG GAG TCT CCC GAC ACC ACC TCC CCG GAG TCT 7 68 

Pro Asn Thr Thr Ser Pro Glu Ser Pro Asp Thr Thr Ser Pro Glu Ser 
245 250 255 

CCC GAC ACC ACC TCC CAG GAG CCT CCC GAC ACC ACC TCC CAG GAG CCT 816 
Pro Asp Thr Thr Ser Gin Glu Pro Pro Asp Thr Thr Ser Gin Glu Pro 
260 265 270 

CCC GAC ACC ACC TCC CAG GAG CCT CCC GAC ACC ACC TCC CCG GAG CCT 864 
Pro Asp Thr Thr Ser Gin Glu Pro Pro Asp Thr Thr Ser Pro Glu Pro 
2 75 280 285 
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CCC GAC AAG ACC TCC CCG GAG CCC GCC CCC CAG CAG GGC TCC ACA CAC 912 

Pro Asp Lys Thr Ser Pro Glu Pro Ala Pro Gin Gin Gly Ser Thr His 

290 295 300 

ACC CCC AGG AGC CCA GGC TCC ACC AGG ACT CGC CGC CCT GAG ATC TCC 960 
Thr Pro Arg Ser Pro Gly Ser Thr Arg Thr Arg Arg Pro Glu lie Ser 
305 310 315 320 



CAG GCT GGG CCC ACG CAG GGA GAA GTG ATC CCA ACA GGC TCG TCC AAA 
Gin Ala Gly Pro Thr Gin Gly Glu Val lie Pro Thr Gly Ser Ser Lys 
325 330 335 



CTT CTG CCC CAG GTG TCG GCC TGG GCT GGG TTA AGG GGG ACC GGC CAG 
Leu Leu Pro Gin Val Ser Ala Trp Ala Gly Leu Arg Gly Thr Gly Gin 
385 390 395 400 



1008 



CCT GCG GGT GAC CAG CTG CCC GCG GCT CTG TGG ACC AGC AGT GCG GTG 105 6 

Pro Ala Gly Asp Gin Leu Pro Ala Ala Leu Trp Thr Ser Ser Ala Val 
340 345 350 

CTG GGA CTG CTG CTC CTG GCC TTG CCC ACG TAT CAC CTC TGG AAA CGC 1104 
Leu Gly Leu Leu Leu Leu Ala Leu Pro Thr Tyr His Leu Trp Lys Arg 
355 360 365 

TGC CGG CAC CTG GCT GAG GAC GAC ACC CAC CCA CCA GCT TCT CTG AGG 1152 
Cys Arg His Leu Ala Glu Asp Asp Thr His Pro Pro Ala Ser Leu Arq 
370 375 380 



1200 



GTC GGG ATC AGC CCC TCC TGAGTGGCCA GCCTTTCCCC CTGTGAAAGC 12 48 
Val Gly lie Ser Pro Ser 
405 

AAAATAGCTT GGACCCCTTC AAGTTGAGAA CTGGTCAGGG CAAACCTGCC TCCCATTCTA 1308 

CTCAAAGTCA TCCCTCTGCT CACAGAGATG GATGCATGTT CTGATTGCCT CTTTGGAGAA 13 68 

GCTCATCAGA AACTCAAAAG AAGGCCACTG TTTGTCTCAC CTACCCATGA CCTGAAGCCC 142 8 

CTCCCTGAGT GGTCCCCACC TTTCTGGACG GAACCACGTA CTTTTTACAT ACATTGATTC 1488 

ATGTCTCACG TCTCCCTAAA AATGCGTAAG ACCAAGCTGT GCCCTGACCA CCCTGGGCCC 1548 

CTGTCGTCAG GACCTCCTGA GGCTTTGGCA AATAAACCTC CTAAAATGAT AAAAAAAAAA 1608 

AAAAAAAAAA AAAAAA 1624 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp Phe Gly Leu Ala Leu Leu Leu Ala Gly Leu Leu Gly Leu Leu 
1 5 10 15 
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Leu Gly Gin Ser Leu Gin Val Lys Pro Leu Gin Val Glu Pro Pro Glu 
20 25 30 

Pro Val Val Ala Val Ala Leu Gly Ala Ser Arg Gin Leu Thr Cys Ara 
35 40 45 

Leu Ala Cys Ala Asp Arg Gly Ala Ser Val Gin Trp Arg Gly Leu Asp 
50 55 60 

Thr Ser Leu Gly Ala Val Gin Ser Asp Thr Gly Arg Ser Val Leu Thr 
65 70 75 80 

Val Arg Asn Ala Ser Leu Ser Ala Ala Gly Thr Arg Val Cys Val Gly 
85 90 95 

Ser Cys Gly Gly Arg Thr Phe Gin His Thr Val Gin Leu Leu Val Tvr 
100 105 HO 

Ala Phe Pro Asp Gin Leu Thr Val Ser Pro Ala Ala Leu Val Pro Glv 
115 120 125 

Asp Pro Glu Val Ala Cys Thr Ala His Lys Val Thr Pro Val Asp Pro 
130 135 140 

Asn Ala Leu Ser Phe Ser Leu Leu Val Gly Gly Gin Glu Leu Glu Glv 
145 150 155 16 5 

Ala Gin Ala Leu Gly Pro Glu Val Gin Glu Glu Glu Glu Glu Pro Gin 
165 170 175 

Gly Asp Glu Asp Val Leu Phe Arg Val Thr Glu Arg Trp Arg Leu Pro 
180 185 190 

Pro Leu Gly Thr Pro Val Pro Pro Ala Leu Tyr Cys Gin Ala Thr Met 
195 200 205 

Arg Leu Pro Gly Leu Glu Leu Ser His Arg Gin Ala He Pro Val Leu 
210 215 220 

His Ser Pro Thr Ser Pro Glu Pro Pro Asp Thr Thr Ser Pro Glu Pro 
225 230 235 240 

Pro Asn Thr Thr Ser Pro Glu Ser Pro Asp Thr Thr Ser Pro Glu Ser 
245 250 255 

Pro Asp Thr Thr Ser Gin Glu Pro Pro Asp Thr Thr Ser Gin Glu Pro 
260 265 270 

Pro Asp Thr Thr Ser Gin Glu Pro Pro Asp Thr Thr Ser Pro Glu Pro 
275 280 285 

Pro Asp Lys Thr Ser Pro Glu Pro Ala Pro Gin Gin Gly Ser Thr His 
290 295 300 

Thr Pro Arg Ser Pro Gly Ser Thr Arg Thr Arg Arg Pro Glu He Ser 
305 310 315 320 

Gin Ala Gly Pro Thr Gin Gly Glu Val He Pro Thr Gly Ser Ser Lvs 
325 330 335 

Pro Ala Gly Asp Gin Leu Pro Ala Ala Leu Trp Thr Ser Ser Ala Val 
340 345 350 
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Leu Gly Leu Leu Leu Leu Ala Leu Pro Thr Tyr His Leu Trp Lys Arq 
355 360 365 

Cys Arg His Leu Ala Glu Asp Asp Thr His Pro Pro Ala Ser Leu Arq 
370 375 380 

Leu Leu Pro Gin Val Ser Ala Trp Ala Gly Leu Arg Gly Thr Gly Gin 
385 390 395 400 



Val Gly He Ser Pro Ser 
405 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1146 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GAT TTC GGA CTG GCC CTC CTG CTG GCG GGG CTT CTG GGG CTC CTC 48 

Met Asp Phe Gly Leu Ala Leu Leu Leu Ala Gly Leu Leu Gly Leu Leu 
15 10 15 

CTC GGC CAG TCC CTC CAG GTG AAG CCC CTG CAG GTG GAG CCC CCG GAG 9 6 

Leu Gly Gin Ser Leu Gin Val Lys Pro Leu Gin Val Glu Pro Pro Glu 
20 25 30 

CCG GTG GTG GCC GTG GCC TTG GGC GCC TCG CGC CAG CTC ACC TGC CGC 144 

Pro Val Val Ala Val Ala Leu Gly Ala Ser Arg Gin Leu Thr Cys Arq 
35 40 45 



CTG GCC TGC GCG GAC CGC GGG GCC TCG GTG CAG TGG CGG GGC CTG GAC 19 2 

Leu Ala Cys Ala Asp Arg Gly Ala Ser Val Gin Trp Arg Gly Leu Asp 
50 55 60 

ACC AGC CTG GGC GCG GTG CAG TCG GAC ACG GGC CGC AGC GTC CTC ACC 240 
Thr Ser Leu Gly Ala Val Gin Ser Asp Thr Gly Arg Ser Val Leu Thr 
65 70 75 80 

GTG CGC AAC GCC TCG CTG TCG GCG GCC GGG ACC CGC GTG TGC GTG GGC 288 
Val Arg Asn Ala Ser Leu Ser Ala Ala Gly Thr Arg Val Cys Val Gly 
85 90 95 

TCC TGC GGG GGC CGC ACC TTC CAG CAC ACC GTG CAG CTC CTT GTG TAC 33 6 

Ser Cys Gly Gly Arg Thr Phe Gin His Thr Val Gin Leu Leu Val Tyr 
100 105 HO 
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GCC TTC CCG GAC CAG CTG ACC GTC TCC CCA GCA GCC CTG GTG CCT GGT 384 

Ala Phe Pro Asp Gin Leu Thr Val Ser Pro Ala Ala Leu Val Pro Glv 
115 120 125 

GAC CCG GAG GTG GCC TGT ACG GCC CAC AAA GTC ACG CCC GTG GAC CCC 432 
Asp Pro Glu Val Ala Cys Thr Ala His Lys Val Thr Pro Val Asp Pro 
130 135 1 40 

AAC GCG CTC TCC TTC TCC CTG CTC GTC GGG GGC CAG GAA CTG GAG GGG 480 
Asn Ala Leu Ser Phe Ser Leu Leu Val Gly Gly Gin Glu Leu Glu Glv 
145 150 155 i 6 g 

GCG CAA GCC CTG GGC CCG GAG GTG CAG GAG GAG GAG GAG GAG CCC CAG 528 
Ala Gin Ala Leu Gly Pro Glu Val Gin Glu Glu Glu Glu Glu Pro Gin 
165 170 175 

GGG GAC GAG GAC GTG CTG TTC AGG GTG ACA GAG CGC TGG CGG CTG CCG 5 76 

Gly Asp Glu Asp Val Leu Phe Arg Val Thr Glu Arg Trp Arq Leu Pro 
180 185 190 

CCC CTG GGG ACC CCT GTC CCG CCC GCC CTC TAC TGC CAG GCC ACG ATG 624 
Pro Leu Gly Thr Pro Val Pro Pro Ala Leu Tyr Cys Gin Ala Thr Met 
195 200 205 

AGG CTG CCT GGC TTG GAG CTC AGC CAC CGC CAG GCC ATC CCC GTC CTG 6 72 

Arg Leu Pro Gly Leu Glu Leu Ser His Arg Gin Ala lie Pro Val Leu 
210 215 220 

CAC AGC CCG ACC TCC CCG GAG CCT CCC GAC ACC ACC TCC CCG GAG TCT 
His Ser Pro Thr Ser Pro Glu Pro Pro Asp Thr Thr Ser Pro Glu Ser 
225 230 235 240 

CCC GAC ACC ACC TCC CCG GAG TCT CCC GAC ACC ACC TCC CAG GAG CCT 
Pro Asp Thr Thr Ser Pro Glu Ser Pro Asp Thr Thr Ser Gin Glu Pro 
245 250 255 

CCC GAC ACC ACC TCC CCG GAG CCT CCC GAC AAG ACC TCC CCG GAG CCC 816 
Pro Asp Thr Thr Ser Pro Glu Pro Pro Asp Lys Thr Ser Pro Glu Pro 
260 265 270 

GCC CCC CAG CAG GGC TCC ACA CAC ACC CCC AGG AGC CCA GGC TCC ACC 
Ala Pro Gin Gin Gly Ser Thr His Thr Pro Arg Ser Pro Glv Ser Thr 
275 280 285 

AGG ACT CGC CGC CCT GAG ATC TCC CAG GCT GGG CCC ACG CAG GGA GAA 912 
Arg Thr Arg Arg Pro Glu He Ser Gin Ala Gly Pro Thr Gin Gly Glu 
290 295 300 

GTG ATC CCA ACA GGC TCG TCC AAA CCT GCG GGT GAC CAG CTG CCC GCG 
Val He Pro Thr Gly Ser Ser Lys Pro Ala Gly Asp Gin Leu Pro Ala 
305 310 315 320 

GCT CTG TGG ACC AGC AGT GCG GTG CTG GGA CTG CTG CTC CTG GCC TTG 
Ala Leu Trp Thr Ser Ser Ala Val Leu Gly Leu Leu Leu Leu Ala Leu 
325 330 335 

CCC ACC TAT CAC CTC TGG AAA CGC TGC CGG CAC CTG GCT GAG GAC GAC 105 6 

Pro Thr Tyr His Leu Trp Lys Arg Cys Arg His Leu Ala Glu Asp Asp 
340 345 350 H 



720 



768 



864 



960 



1008 
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ACC CAC CCA CCA GCT TCT CTG AGG CTT CTG CCC CAG GTG TCG GCC TGG 1104 
Thr His Pro Pro Ala Ser Leu Arg Leu Leu Pro Gin Val Ser Ala Trn 
355 360 365 

GCT GGG TTA AGG GGG ACC GGC CAG GTC GGG ATC AGC CCC TCC i 146 
Ala Gly Leu Arg Gly Thr Gly Gin Val Gly lie Ser Pro Ser 
370 375 380 

TGAGTGGCCA GCCTTTCCCC CTGTGAAAGC AAAATAGCTT GGACCCCTTC AAGTTGAGAA 1206 

CTGGTCAGGG CAAACCTGCC TCCCATTCTA CTCAAAGTCA TCCCTCTGTT CACAGAGATG 12 66 

GATGCATGTT CTGATTGCCT CTTTGGAGAA GCTCATCAGA AACTCAAAAG AAGGCCACTG 132 6 

TTTGTCTCAC CTACCCATGA CCTGAAGCCC CTCCCTGAGT GGTCCCCACC TTTCTGGACG 138 6 

GAACCACGTA CTTTTTACAT ACATTGATTC ATGTCTCACG TCTCCCTAAA AATGCGTAAG 1446 

ACCAAGCTGT GCCCTGACCA CCCTGGGCCC CTGTCGTCAG GACCTCCTGA GGCTTTGGCA 1506 

AATAAACCTC CTAAAATGAA AAAAAAAAAA AAA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 382 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Asp Phe Gly Leu Ala Leu Leu Leu Ala Gly Leu Leu Gly Leu Leu 
15 10 15 

Leu Gly Gin Ser Leu Gin Val Lys Pro Leu Gin Val Glu Pro Pro Glu 
20 25 30 

Pro Val Val Ala Val Ala Leu Gly Ala Ser Arg Gin Leu Thr Cys Ara 
35 40 45 

Leu Ala Cys Ala Asp Arg Gly Ala Ser Val Gin Trp Arg Gly Leu Asp 
50 55 60 

Thr Ser Leu Gly Ala Val Gin Ser Asp Thr Gly Arg Ser Val Leu Thr 
65 70 75 80 

Val Arg Asn Ala Ser Leu Ser Ala Ala Gly Thr Arg Val Cys Val Gly 
85 90 95 

Ser Cys Gly Gly Arg Thr Phe Gin His Thr Val Gin Leu Leu Val Tyr 
100 105 HO 

Ala Phe Pro Asp Gin Leu Thr Val Ser Pro Ala Ala Leu Val Pro Gly 
115 120 125 
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Asp Pro Glu Val Ala Cys Thr Ala His Lys Val Thr Pro Val Asp Pro 
130 135 140 

Asn Ala Leu Ser Phe Ser Leu Leu Val Gly Gly Gin Glu Leu Glu Gly 
145 150 155 160 

Ala Gin Ala Leu Gly Pro Glu Val Gin Glu Glu Glu Glu Glu Pro Gin 
165 170 175 

Gly Asp Glu Asp Val Leu Phe Arg Val Thr Glu Arg Trp Arg Leu Pro 

180 185 190 

Pro Leu Gly Thr Pro Val Pro Pro Ala Leu Tyr Cys Gin Ala Thr Met 
195 200 205 

Arg Leu Pro Gly Leu Glu Leu Ser His Arg Gin Ala lie Pro Val Leu 
210 215 220 



His Ser Pro Thr Ser Pro Glu Pro Pro Asp Thr Thr Ser Pro Glu Ser 
225 230 235 240 

Pro Asp Thr Thr Ser Pro Glu Ser Pro Asp Thr Thr Ser Gin Glu Pro 
245 250 255 

Pro Asp Thr Thr Ser Pro Glu Pro Pro Asp Lys Thr Ser Pro Glu Pro 
260 265 270 

Ala Pro Gin Gin Gly Ser Thr His Thr Pro Arg Ser Pro Gly Ser Thr 
275 280 285 

Arg Thr Arg Arg Pro Glu lie Ser Gin Ala Gly Pro Thr Gin Gly Glu 
290 295 300 

Val lie Pro Thr Gly Ser Ser Lys Pro Ala Gly Asp Gin Leu Pro Ala 
305 310 315 320 

Ala Leu Trp Thr Ser Ser Ala Val Leu Gly Leu Leu Leu Leu Ala Leu 
325 330 335 

Pro Thr Tyr His Leu Trp Lys Arg Cys Arg His Leu Ala Glu Asp Asp 
340 345 350 

Thr His Pro Pro Ala Ser Leu Arg Leu Leu Pro Gin Val Ser Ala Trp 
355 360 365 

Ala Gly Leu Arg Gly Thr Gly Gin Val Gly lie Ser Pro Ser 
370 375 380 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1721 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 4 . . 1038 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

AGC ATG GAT CGG GGC CTG GCC CTC CTG CTG GCG GGG CTT CTG GGG CTC 
Met Asp Arg Gly Leu Ala Leu Leu Leu Ala Gly Leu Leu Gly Leu 
15 10 15 

CTC CAG CCG GGC TGC GGC CAG TCC CTC CAG GTG AAG CCC CTG CAG GTG 
Leu Gin Pro Gly Cys Gly Gin Ser Leu Gin Val Lys Pro Leu Gin Val 
20 25 30 

GAG CCC CCG GAG CCG GTG GTG GCC GTG GCC CTG GGC GCC TCT CGC CAG 
Glu Pro Pro Glu Pro Val Val Ala Val Ala Leu Gly Ala Ser Arg Gin 
35 40 45 



CTC ACC TGC CGC CTG GAC TGC GCG GAC CGC GGG GCC ACG GTG CAG TGG 192 
Leu Thr Cys Arg Leu Asp Cys Ala Asp Arg Gly Ala Thr Val Gin Trr> 
50 55 60 

CGG GGC CTG GAC ACC AGC CTG GGC GCG GTG CAG TCG GAC GCG GGC CGC 2 40 

Arg Gly Leu Asp Thr Ser Leu Gly Ala Val Gin Ser Asp Ala Gly Ara 
65 70 75 

AGC GTC CTC ACC GTG CGC AAC GCC TCG CTG TCG GCG GCC GGG ACC CGT 288 
Ser Val Leu Thr Val Arg Asn Ala Ser Leu Ser Ala Ala Gly Thr Arg 
80 85 90 95 

GTG TGC GTG GGC TCC TGC GGG GGC CGC ACC TTC CAG CAC ACC GTG CGG 33 6 

Val Cys Val Gly Ser Cys Gly Gly Arg Thr Phe Gin His Thr Val Arq 
100 105 H0 

CTC CTT GTG TAC GCC TTC CCG GAC CAG CTG ACC ATC TCC CCG GCA GCC 384 
Leu Leu Val Tyr Ala Phe Pro Asp Gin Leu Thr He Ser Pro Ala Ala 
115 120 125 

CTG GTG CCT GGT GAC CCG GAG GTG GCC TGT ACG GCC CAC AAA GTC ACG 432 
Leu Val Pro Gly Asp Pro Glu Val Ala Cys Thr Ala His Lys Val Thr 
130 135 140 

CCT GTG GAC CCC AAT GCG CTC TCC TTC TCC CTG CTC CTG GGG GAC CAG 480 
Pro Val Asp Pro Asn Ala Leu Ser Phe Ser Leu Leu Leu Gly Asp Gin 
145 150 155 

GAA CTG GAG GGG GCC CAG GCT CTG GGC CCG GAG GTG GAG GAG GAG GAG 52 8 

Glu Leu Glu Gly Ala Gin Ala Leu Gly Pro Glu Val Glu Glu Glu Glu 
160 165 170 175 

GAG GAG CCC CAG GAG GAG GAG GAC GTG CTG TTC AGG GTG ACA GAG CGC 57 6 

Glu Glu Pro Gin Glu Glu Glu Asp Val Leu Phe Arg Val Thr Glu Arg 
180 185 190 



TGG CGG CTG CCG ACC CTG GCA ACC CCT GTC CTG CCC GCG CTC TAC TGC 62 4 

Trp Arg Leu Pro Thr Leu Ala Thr Pro Val Leu Pro Ala Leu Tyr Cys 
195 200 205 
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CAG GCC ACG ATG AGG CTG CCT GGC TTG GAG CTC AGC CAC CGC CAG GCC 

Gin Ala Thr Met Arg Leu Pro Gly Leu Glu Leu Ser His Arg Gin Ala 

210 215 220 

ATC CCG GTC CTG CAC GGC CCG ACC TCC CGG GAG CCC CCC GAC ACG ACC 
lie Pro Val Leu His Gly Pro Thr Ser Arg Glu Pro Pro Asp Thr Thr 
225 230 235 

TCC CCG GAA CCC CGG GCC GCG ACC TCC CCG GAG ACC ACC CCC CAG CAG 
Ser Pro Glu Pro Arg Ala Ala Thr Ser Pro Glu Thr Thr Pro Gin Gin 
240 245 250 255 

GGC TCC ACA CAC AGC CCC AGG AGC CCG GGC TCT ACC AGG ACT TGC CGC 
Gly Ser Thr His Ser Pro Arg Ser Pro Gly Ser Thr Arg Thr Cys Arg 
260 265 270 

CCT GAG ATC TCC CAG GCT GGG CCC ACG CAG GGA GAA GTG ATC CCA ACA 
Pro Glu lie Ser Gin Ala Gly Pro Thr Gin Gly Glu Val lie Pro Thr 
275 280 285 



GGC TCG TCC AAA CCT ACG GGT GAC CAG CTG CCC GCG GCT CTG TGG ACC 

Gly Ser Ser Lys Pro Thr Gly Asp Gin Leu Pro Ala Ala Leu Trp Thr 
290 295 300 

AGC AGT GCG GTG CTG GGA CTG CTG CTC CTG GCT TTG CCC ACC TAC CAC 

Ser Ser Ala Val Leu Gly Leu Leu Leu Leu Ala Leu Pro Thr Tyr His 
305 310 315 

CTC TGG AAA CGT TGC CGG CAC CTG GCT GAG GAC GGC GCC CAC CCA CCA 

Leu Trp Lys Arg Cys Arg His Leu Ala Glu Asp Gly Ala His Pro Pro 

320 325 330 335 

GCT TCT CTG AGT AGC CAG CCC TTC CCC CTG TGAAGGGAAA ATAGGTTGGA 
Ala Ser Leu Ser Ser Gin Pro Phe Pro Leu 





340 




345 






CCCCTTCAAG 


CTGAGAACTG 


GTCGGGGCAA 


ACCTGCCTCC 


CATTCTATTC 


AAAGTCATCG 


CTCTGGTCAC 


AGAGAGGGAC 


GCACATTCTG 


ATTGCCTCCT 


TTGGAAAGGC 


TCATCAGAAA 


CTCAAAAGAA 


GGTGATCGTT 


TGTCCCGCCT 


ACCCGTGACC 


TGGAAGCCCC 


CGCCCCGCTC 


GAGTGACCCC 


TGACTTTCTG 


GACGGAACCA 


ACGTACTTCT 


TACATATATT 


GATTCATGTG 


TCATATCTCC 


CTAAAATGCG 


TAAAACCAGC 


TGTGCCCCGA 


CCACCTTGGG 


CCCCTGCCAT 


CAGGACCTCC 


TGAGGCTTTG 


GCAAATAAAC 


CTCCTAAAAG 


GATAGAAACT 


GAAACTTGTG 


GCCGGGCGCG 


GTGGCTCAAG 


CCTGTAATCC 


CAGCACTTTG 


GG AGG CCG AG 


GTGGGTGGAT 


CACGAGGTCA 


GGAGATCGAG 


ACCATCCTGG 


CTAACCCGTG 


AAACCCCGTC 


TCTACTAAAA 


AAATACAAAA 


ATTAGCCGGG 


AGCGGTGGCG 


GGCGCCTGTA 


GTCCCAGCTA 


CTCGGGAGGC 


TGAGGCAGGA 


GAATGGCGTG 


AACCCGGGAG 


GCGGAGCTTG 


CAGTGAGCTG 


AGATCCGGCC 


ACTGCACTCC 


AGCCTGGGGG 


ACAGAGCGAG 


ACTCCGTCTC 


AAAAAAAAAA 


AAAAAAAAAA 



AAA 



672 
720 
768 
816 
864 

912 

960 

1008 

1058 

1118 
1178 
1238 
1298 
1358 
1418 
1478 
1538 
1598 
1658 
1718 
1721 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Asp Arg Gly Leu Ala Leu Leu Leu Ala Gly Leu Leu Gly Leu Leu 

15 10 15 

Gin Pro Gly Cys Gly Gin Ser Leu Gin Val Lys Pro Leu Gin Val Glu 

20 25 30 

Pro Pro Glu Pro Val Val Ala Val Ala Leu Gly Ala Ser Arg Gin Leu 

35 40 45 



Thr Cys Arg Leu 
50 



Gly Leu Asp Thr 
65 



Val Leu Thr Val 



Cys Val Gly Ser 
100 



Leu Val Tyr Ala 
115 



Val Pro Gly Asp 
130 



Val Asp Pro Asn 
145 



Leu Glu Gly Ala 



Glu Pro Gin Glu 
180 



Arg Leu Pro Thr 
195 



Ala Thr Met Arg 
210 



Pro Val Leu His 
225 



Pro Glu Pro Arg 



Asp Cys Ala Asp 
55 

Ser Leu Gly Ala 
70 

Arg Asn Ala Ser 
85 

Cys Gly Gly Arg 



Phe Pro Asp Gin 
120 

Pro Glu Val Ala 
135 

Ala Leu Ser Phe 
150 

Gin Ala Leu Gly 
165 

Glu Glu Asp Val 



Leu Ala Thr Pro 
200 

Leu Pro Gly Leu 
215 

Gly Pro Thr Ser 
230 

Ala Ala Thr Ser 
245 



Arg Gly Ala Thr 
60 

Val Gin Ser Asp 
75 

Leu Ser Ala Ala 

90 

Thr Phe Gin His 
105 

Leu Thr lie Ser 



Cys Thr Ala His 
140 

Ser Leu Leu Leu 
155 

Pro Glu Val Glu 
170 

Leu Phe Arg Val 
185 

Val Leu Pro Ala 



Glu Leu Ser His 
220 

Arg Glu Pro Pro 
235 

Pro Glu Thr Thr 
250 



Val Gin Trp Arg 



Ala Gly Arg Ser 
80 

Gly Thr Arg Val 
95 



Thr Val Arg Leu 
110 

Pro Ala Ala Leu 
125 

Lys Val Thr Pro 



Gly Asp Gin Glu 
160 

Glu Glu Glu Glu 
175 

Thr Glu Arg Trp 
190 



Leu Tyr Cys Gin 
205 

Arg Gin Ala lie 



Asp Thr Thr Ser 
240 

Pro Gin Gin Gly 
255 
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Ser Thr His Ser Pro Arg Ser Pro Gly Ser Thr Arg Thr Cys Arq Pro 
260 265 270 



e Pro Thr Gl\ 



Glu lie Ser Gin Ala Gly Pro Thr Gin Gly Glu Val II 
275 280 285 

Ser Ser Lys Pro Thr Gly Asp Gin Leu Pro Ala Ala Leu Trp Thr Ser 
290 295 300 

Ser Ala Val Leu Gly Leu Leu Leu Leu Ala Leu Pro Thr Tyr His Leu 



310 



315 



320 



Trp Lys Arg Cys Arg His Leu Ala Glu Asp Gly Ala His Pro Pro Ala 
325 330 



335 



Ser Leu Ser Ser Gin Pro Phe Pro Leu 
340 345 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CTCTACTGCC AGGCCACG 

18 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AGCCTGGGAG ATCTCAGGG 

19 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GCCACGATGA GGCTGCCTGG 

20 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTGGAGCCTG GGCTCCTGGG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
GGAAGCTTCC ACCATGGATT TCGGACTGGC CC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
S (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCGACTAGTG TCGGGCTGTG CAGGAC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGACTAGTGG TTTGGACGAG CCTGTTG 



27 



